[Inference Client] Factorize inference payload build #2601

hanouticelina · 2024-10-11T13:35:39Z

This PR is a first attempt to factorize the payload build in multiple InferenceClient methods. In fact, there's some repetitive logic across several methods for handling inputs and parameters, so here we introduce a new (private) helper function to factorize this logic.

Key changes

Adding a helper function that:
- handles both raw content (images or audio) and string/dict inputs uniformly.
- base64 encodes raw content when at least one parameter is present.
- filters out None values from the parameters.
- .. and returns an _InferenceInputs object containing the json payload and raw data if any. (I don't have a strong opinion on this, we can also return a Tuple instead).
A unit test to verify the correct behavior of the refactored logic.

⚠️ These changes only affect the internal/private functionality of the InferenceClient and AsyncInferenceClient.

HuggingFaceDocBuilderDev · 2024-10-11T13:39:41Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Wauplin

Thanks @hanouticelina! I'm sorry I realized to late that I commented on both _client.py and _async_client.py but all comments applied to both (since it's autogenerated). My main concern is about determining if a task expects binaries as input or not (see below). Let me know if you have other ideas on how to fix it. I'm half-happy about the suggested solution of expect_binary: bool 😄

src/huggingface_hub/inference/_generated/_async_client.py

tests/test_inference_client.py

Wauplin · 2024-10-11T14:52:02Z

src/huggingface_hub/inference/_generated/_async_client.py

+
+        def is_raw_content(inputs: Union[str, Dict[str, Any], ContentT]) -> bool:
+            return isinstance(inputs, (bytes, Path)) or (
+                isinstance(inputs, str) and inputs.startswith(("http://", "https://"))


This is an annoying part 😕 Depending on the context, inputs.startswith(("http://", "https://")) should lead to different behavior:

in image_to_text, a url as input must be passed as post(data=...) so that the url is loaded and sent to the inference server

in feature_extraction, a url as input should be passed as post(payload={"inputs": ...} => a URL is a special case of string input, but still a valid one

Since _prepare_payload is agnostic of the task, it can't know in which case we are.

What do you think of modifying the method signature to

def _prepare_payload( inputs: Union[str, Dict[str, Any], ContentT], parameters: Optional[Dict[str, Any]], expect_binary: bool, )

?

For tasks that expect a binary input (image_to_*, audio_to_*), you pass _prepare_payload(..., expect_binary=True).

This way you could have a logic like this:

is_binary = isinstance(inputs, (bytes, Path)) if expect_binary and not is_binary and not isinstance(inputs, str): raise ValueError(...) # should be a binary or at least a string (local path or url) if expect_binary and not has_parameter: return _InferenceInputs(raw_data=inputs) if not expect_binary and is_binary: raise ValueError(...) # cannot be a binary # else set as "inputs" in a json payload ...

hum yes you're right! actually I did not update image_to_text as we don't have any parameters or logic to either send a json payload or a raw data:

response = self.post(data=image, model=model, task="image-to-text") output = ImageToTextOutput.parse_obj(response) return output[0] if isinstance(output, list) else output

but of course your point is totally valid.
Having a flag seems to cover all the cases. In the beginning I though about having a Input type enum and add a input_type arg to _prepare_payload() but it's simpler to just use a expect_binary flag. I don't have a better solution either for now 😕
I will fix the suggestions and I will get back to this :)

src/huggingface_hub/inference/_client.py

hanouticelina · 2024-10-14T10:48:31Z

thanks @Wauplin for the review! I addressed your suggestions and I ended up adding an expect_binary flag as it's the simplest way to handle the special case of image and audio (path, URL and binary) inputs :)

Wauplin

Looking good! I have a question related to question answering and then we should be good to merge

src/huggingface_hub/inference/_common.py

Wauplin

Been a bit picky on corner cases here but I do think it's worth it 😇

src/huggingface_hub/inference/_common.py

tests/test_inference_client.py

Wauplin

Thanks @hanouticelina! That should make the inference client more reliable on corner cases in the future. And reduce the amount of duplicate code 😄

src/huggingface_hub/inference/_client.py

src/huggingface_hub/inference/_generated/_async_client.py

hanouticelina · 2024-10-15T13:33:23Z

thanks @Wauplin! I think we're good to merge this one

hanouticelina added 3 commits October 11, 2024 15:14

Factorize inference payload build and add test

40259bd

Add comments

bc85ecc

Add method description

a939a76

hanouticelina requested a review from Wauplin October 11, 2024 13:35

hanouticelina added 2 commits October 11, 2024 15:44

fix style

719c9e4

fix style again

0d6a058

Wauplin reviewed Oct 11, 2024

View reviewed changes

hanouticelina added 5 commits October 14, 2024 11:41

fix prepare payload helper

d669e79

experiment: try old version of workflow

64fc43c

revert experiment: try old version of workflow

59eb39a

Add docstring

fb8e864

update docstring

bfe82f9

hanouticelina requested a review from Wauplin October 14, 2024 10:48

Wauplin reviewed Oct 14, 2024

View reviewed changes

src/huggingface_hub/inference/_common.py Outdated Show resolved Hide resolved

src/huggingface_hub/inference/_common.py Outdated Show resolved Hide resolved

src/huggingface_hub/inference/_common.py Outdated Show resolved Hide resolved

hanouticelina added 2 commits October 14, 2024 15:59

simplify json payload construction when inputs is a dict

efef1a9

ignore mypy str bytes warning

b4bd2b5

hanouticelina requested a review from Wauplin October 14, 2024 14:16

Wauplin reviewed Oct 15, 2024

View reviewed changes

src/huggingface_hub/inference/_common.py Outdated Show resolved Hide resolved

src/huggingface_hub/inference/_common.py Outdated Show resolved Hide resolved

tests/test_inference_client.py Show resolved Hide resolved

fix encoding condition

9ac79df

hanouticelina requested a review from Wauplin October 15, 2024 10:56

Wauplin approved these changes Oct 15, 2024

View reviewed changes

src/huggingface_hub/inference/_client.py Outdated Show resolved Hide resolved

src/huggingface_hub/inference/_generated/_async_client.py Outdated Show resolved Hide resolved

remove unnecessary checks for parameters

c8e7cc8

hanouticelina merged commit a4bc2e5 into main Oct 15, 2024
19 checks passed

hanouticelina deleted the factorize-inference-payload branch October 15, 2024 13:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Inference Client] Factorize inference payload build #2601

[Inference Client] Factorize inference payload build #2601

hanouticelina commented Oct 11, 2024

HuggingFaceDocBuilderDev commented Oct 11, 2024

Wauplin left a comment •

edited

Loading

Wauplin Oct 11, 2024

Wauplin Oct 11, 2024

hanouticelina Oct 11, 2024

hanouticelina commented Oct 14, 2024

Wauplin left a comment

Wauplin left a comment

Wauplin left a comment

hanouticelina commented Oct 15, 2024

[Inference Client] Factorize inference payload build #2601

[Inference Client] Factorize inference payload build #2601

Conversation

hanouticelina commented Oct 11, 2024

Key changes

HuggingFaceDocBuilderDev commented Oct 11, 2024

Wauplin left a comment • edited Loading

Choose a reason for hiding this comment

Wauplin Oct 11, 2024

Choose a reason for hiding this comment

Wauplin Oct 11, 2024

Choose a reason for hiding this comment

hanouticelina Oct 11, 2024

Choose a reason for hiding this comment

hanouticelina commented Oct 14, 2024

Wauplin left a comment

Choose a reason for hiding this comment

Wauplin left a comment

Choose a reason for hiding this comment

Wauplin left a comment

Choose a reason for hiding this comment

hanouticelina commented Oct 15, 2024

Wauplin left a comment •

edited

Loading